Efficient Low-Redundancy Codes for Correcting Multiple Deletions
نویسندگان
چکیده
We consider the problem of constructing binary codes to recover from k–bit deletions with efficient encoding/decoding, for a fixed k. The single deletion case is well understood, with the VarshamovTenengolts-Levenshtein code from 1965 giving an asymptotically optimal construction with ≈ 2n/n codewords of length n, i.e., at most logn bits of redundancy. However, even for the case of two deletions, there was no known explicit construction with redundancy less than nΩ(1). For any fixed k, we construct a binary code with ck logn redundancy that can be decoded from k deletions in Ok(n log 4 n) time. The coefficient ck can be taken to be O(k2 logk), which is only quadratically worse than the optimal, non-constructive bound of O(k). We also indicate how to modify this code to allow for a combination of up to k insertions and deletions. We also note that among linear codes capable of correcting k deletions, the (k+ 1)-fold repetition code is essentially the best possible.
منابع مشابه
Codes Correcting Two Deletions
In this work, we investigate the problem of constructing codes capable of correcting two deletions. In particular, we construct a code that requires redundancy approximately 8 log2 n+O(log2 log2 n) bits of redundancy, where n denotes the length of the code. To the best of the author’s knowledge, this represents the best known construction in that it requires the lowest number of redundant bits ...
متن کاملCorrecting Bursty and Localized Deletions Using Guess & Check Codes
We consider the problem of constructing binary codes for correcting deletions that are localized within certain parts of the codeword that are unknown a priori. The model that we study is when δ ≤ w deletions occur in a window of size at most w bits. These δ deletions are not necessarily consecutive, but are restricted to the window of size w. The localized deletions model is a generalization o...
متن کاملCodes for Correcting a Burst of Deletions or Insertions
This paper studies codes which correct bursts of deletions. Namely, a binary code will be called a b-burst-correcting code if it can correct a deletion of any b consecutive bits. While the lower bound on the redundancy of such codes was shown by Levenshtein to be asymptotically log(n) + b− 1, the redundancy of the best code construction by Cheng et al. is b(log(n/b+1)). In this paper, we close ...
متن کاملBounds on Codes Correcting Tandem and Palindromic Duplications
In this work, we derive upper bounds on the cardinality of tandem and palindromic duplication correcting codes by deriving the generalized sphere packing bound for these error types. We first prove that an upper bound for tandem or palindromic deletions is also an upper bound for inserting the respective type of duplications. Therefore, we derive the bounds based on these special deletions as t...
متن کاملGuess & Check Codes for Deletions, Insertions, and Synchronization
We consider the problem of constructing codes that can correct δ deletions occurring in an arbitrary binary string of length n bits. Varshamov-Tenengolts (VT) codes are zeroerror single deletion (δ = 1) correcting codes, and have an asymptotically optimal redundancy. Finding similar codes for δ ≥ 2 deletions is an open problem. We propose a new family of codes, that we call Guess & Check (GC) c...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Electronic Colloquium on Computational Complexity (ECCC)
دوره 22 شماره
صفحات -
تاریخ انتشار 2015